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CTN ' The set of the fixed points of the Hopfield type network is under investigation. The connection 

O^ , matrix of the network is constructed according to the Hebb rule from the set of memorized patterns 

which are treated as distorted copies of the standard-vector. It is found that the dependence of the 
set of the fixed points on the value of the distortion parameter can be described analytically. The 
Ph ■ obtained results are interpreted in the terms of neural networks and the Ising model. 
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I. INTRODUCTION 

The problem of maximization of a symmetric form 

P{^) = Elj=i Jij(^i<^j -" max, a, = {±1}, 
(7 = (cTi,.. ■,an), Jij = Jjj, i,3 = 1,2,.. .,n, 

which is quadratic in spin variables Ui, is under investigation. This problem arises in the Ising model, in the surface 
physics, wihe theory of optimal coding, in factor analysis, in the theory of neural networks and in the optimization 



I 

^3 . ,,H7, 

(-H , problemaHTJ. The aim is to obtain an effective method for the search of the global maximum of the functional and a 

Q ■ constructive description of the set of its local maxima. 

Q ' The solution of the problem (1) is sought among the n-dimensional vectors a. These vectors define 2" configurations; 
in what follows they are called the configuration vectors. The configuration vector which gives the solution of the 

I problem (1) is called the ground state. 

^ , Let's settle with the choice of the diagonal elements of the matrix J = {Jij)ij=i. Since for any configuration vector 

t~ — ■ the diagonal gives the same contribution to the value of the functional, the local maxima of the functional do not 

On ' depend on Ja. Consequently, Ju can be chosen in an arbitrary suitable form. In the same time, the set of the local 

^~~^ maxima of the functional F is the same as the set of the fixed points of a neural network whose connection matrix 

^5^ differs from J only by zero diagonal elementsn. In what follows, we call J the connection matrix irrespective of its 

^1^ . diagonal. And with regard to neural networks we have in mind that its diagonal elements are equal zero. 

^y. ' Without the loss of generality J can be considered as a positive semidefinite matrix. (It always can be achieved by 
a suitable choice of its diagonal elements.) Then the matrix J can be presented as a product 



S'^ ■ S, (2) 



'"^ ' where S is a (p x n)-matrix with real matrix elements. The number of its rows is equal to the rank of the matrix J 
P3 ' and it does not exceed n — 1: 

o 

O ' p = rank J < n — 1. 



The representation (2) for the connection matrix plays an important role in the theory of neural networks. If all the 
elements of the matrix S arc equal {±1}, the connection matrix is called the Hebb matrix and the related neural 
network is called the Hopfield network. Of course, in general case when according to Eq. (2) the matrix S is obtained 
from an arbitrary symmetric matrix J, not necessarily all the elements of S are equal {±1}. By analogy with the 
Hopfield model, in this case Eq.(2) is called the Hebb type representation and the related neural network is called the 
Hopfield type network. Summarizing, we can say that the local maxima of the problem (1) are the fixed points of a 
Hopfield type network. 



'^For details seeu. 



We investigate the problem (1) in the case of the connection matrix constructed with regard to the Hebb-hke rule 
(2) from the {p x n)-matrix S of the form 



S = 
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where x is an arbitrary real number. The connection matrix J has the form: 
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According to the conventional neural network tradition, we treat the n-dimensional vectors s^" , which are the rows of 
the matrix S, as p generalized memorized patterns embedded in the network memory. Then the following meaningful 
interpretation of the problem can be suggested: the network had to be learned by p-time showing of the standard 

£»-(l,l,...,l), 

but an error crept into the learning process and in fact the network was learned with the help of p distorted copies s*^'' 
of the standard e{n) : 

s<') = (!,...,!, 1-x, !,...,!), I = 1,2,..., p. (4) 



The problem (l)-(3) will be called the basic model. In what follows we present its generalization. 

When X is equal zero, the network is found to be learned by p copies of the standard e{n). It is well-known that 
in this case the vector e*(n) itself is the ground state and the functional has no other local maxima. For continuity 
reasons, it is clear that the same is true for a sufficiently small distortion x. But when x increases the ground state 
changes. For the problem (l)-(3) we succeeded in obtaining the analytical description of the dependence of the ground 
state on the value of the distortion parameter. 

In n the main mathematical results are presented; in III we give the interpretation of the obtained results in terms of 
the neural network and the Ising model. Some proofs which are not significant for the understanding of the structure 
of the solution enter Appendix. _ _. 

We would like to mention that our approach is very close to the problem of generalization in neural networkgatil. 
Under the classical setting of the problem, one is investigating the network ability to reconstruct the set of memorized 
patterns if during the learning their distorted copies have been used. In contrast to the classical setting, in our case 
the distortions are deterministic ones. Moreover, the value of the distortion x is the same for all the memorized 
patterns and only one coordinate is distorted every time. In ^ it is shown that some of these restrictions can be 
omitted. However, the most important restriction of our approach is investigation of the generalization in the case of 
one memorized pattern only, but not the set of memorized patterns. Nevertheless, the meaningful interpretation of 
the obtained results (see III) is of interest for the more general case. 



Notations. We denote by s{k) the configuration vector which is coUinear to the bisectrix of the principle orthant 
of the space R"^. The vector, which after p distortions generates the set of the memorized patterns s^'', is called the 
standard-vector or the standard. Next, n is the number of the spin variables, p is the number of the memorized patterns 
and q = n — p is the number of the nondistorted coordinates of the standard. Configuration vectors are denoted by 
small Greek letters. We use small Latin letters to denote vectors whose coordinates are real. The n-dimensional vectors 
arc numerated by superscripts enclosed in brackets and their coordinates are numerated by subscripts. Matrices are 
denoted by half-bold capital Latin letters. 



II. MATHEMATICAL RESULTS 



Let us look for the local maxima among configuration vectors whose last coordinate is positive. 



A. Basic Model 

Since q last columns of the matrix S are the same, the configuration vector which is "under the suspicion" to provide 
an extremum is of the formn 

a* = ( cri,(T2 ,. . ■,q-p , 1,- ■■,1), (5) 

where we denote by-a' the p-dimensional part of the vector a*, which is formed by its first p coordinates. The direct 
calculations (or seeli3) show that 

F{(T*) oc x'^ — 2x{q +pcosw) cosw + {q + pcoaw)'^, (6) 

where 

cos^^ai^^ ,,.if'f» ,, (7) 

P II ct' II • II eb) II 

is the cosine of the angle between p-dimensional vectors a' and s(j)). Depending on the number of the negative 
coordinates of the vector a' , cosw takes the values 

2k 

coswk = 1 , fc = 0, l,...,p. 

P 

Consequently, 2^ "suspicious-looking" vectors a* are grouped into the p+1 classes Sj. and the functional F{a*) has 
the same value Fj^ for all the vectors from the same class: 

F{a*) = Fk W* e Sfe. 

The classes E^ are numerated by the number k of the negative coordinates which have the relevant vectors a* . The 
number of the vectors in the class S^ is equal to {^). 

To find the ground state under a given value of x, it is necessary to determine the greatest of the values 
Fq(x), Fi(x), . . . ,Fp{x). Under the comparison the common term x"^ can be omitted. Therefore, to find out how 
the ground state depends on the parameter x, it is necessary to examine the family of the straight lines 

Lk{x) = {q + pcoswk)^ — 2x{q + pcoswk) coswk- (8) 

In the region where the Lk{x) majorizes all the other straight lines, the ground state belongs to the class Sfc and is 
(^)-times degenerate. The analysis of the relative position of the straight lines Lk{x) is given in Appendix and leads 
to the following 
Theorem. As x varies from — cx) to cxd the ground state in consecutive order belongs to the classes 

Zjn, ^1 , . . . , ^k 

The rebuilding of the ground state from the class S/c_i into the class T,k occurs at the point Xk of intersection of the 
straight lines Lk-i{x) and Lk{x): 

Xk^^(l + ; ^- r)^p- "-(^fe-1) k = l,2,...,k^a.. (9) 

2\ q + p(coa Wk^i + COS Wk) J n + p — 2(2k — I) 

If ^3[ < h, one after another all the p rebuildings of the ground state take place according to the above scheme: 
kmax = P- And if ^^ > i, the last rebuilding is the one with the number kmax = [ ^ ] • The functional has no 
other local maxima. 

The Theorem allows to solve a lot of practical problems. For example, let n and p be fixed. Then, for any preassigned 
X it is sufficient to know between which Xk and Xk+i it finds itself to determine which of the classes Ej, provide the 
global maximum of the functional F(a), what is its value and the degeneration of the ground state. On the contrary, 
for the fixed x, it is possible to find out such n and p for which the ground state belongs to the preassigned class Sfc, 
etc. 



For details se 
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FIG. 1. The typical behavior of the straight hnes Lk{x), k — 0,1,..., p. The rebuildings of the ground state occurs at 
the points Xk of the intersection of the straight hnes Lfc_i and L^. Inside the interval {xk,Xk+i) the ground state belongs to 



the class E^. When x increases: a), all the rebuildings of the ground state occur (fc„ 
kmax = r "+^+ 1 = 3 rebuildings of the ground state occur, because ^^ > i. 



5), because 
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b). only 



The points Xk (9), where the rebuildings of the ground state take place will be called the rebuilding points. 

In Fig.l the typical examples of the relative position of the straight lines Lk{x) are presented for the cases ^^ < i 

(a) and ^^ > ^ (6). When x changes from — oo to xi, the ground state is the standard- vector e{n) (it exhausts the 
class So). In the point xi the ground state rebuilds from the class Sq to the class Si and becomes p-times degenerate. 
When X achieves the value X2, the ground state rebuilds from the class Si to the class S2 and becomes (2)-times 
degenerate, and so on. As x increases, at first the value of the functional for the ground state monotonically decreases 
and then, after reaching the minimum value, increases monotonically. For what follows let us note that if p < n — 1, 
then kmax > [^] and Xk^^^ > P- 

The case p — n worth to be specially mentioned. Here all the rebuilding points x^ stick to one point 



x' = Xk 



k^ 1,2,3,. 



(10) 



For any x from the left of x' the ground state is the standard- vector e{n) , and for x from the right of x' the ground 
state belongs to the class Srn+i, and is fr n+i ,) -times degenerate. 

The interval xi < x < Xk^^^ will be called the rebuilding region of the ground state. This region is examined in 
details in Appendix. Here we would like to mention only, that its left boundary xi > ^ and it is the monotonically 



increasing function of p as well as of n. And also, when p 
point 



const and n — > 00 the rebuilding region contracts to the 



(11) 



In this case for x < x" the ground state is the standard-vector and for x > x" the ground state belongs to the class 
Sp; again it is a nondegenerate one. 



B. Generalization of the Basic Model 



Let's examine some generalizations of the basic model, which seem to be the most important. From our point of 
view they significantly widen the class of possible connection matrices for which the effective solution of the problem 
(1) can be obtained. 



1. An arbitrary cofiguration standard-vector 

An arbitrary configuration vector 

a = (ai, . . . ,ap,ap+i, . . . ,a„) 

can be used in place of the standard e{n) , and its p distorted copies of the type (4) can be used as memorized patterns. 
These changes affect the elements of the connection matrix: 

J^f = JijaiUj, i, j = 1, 2, . . . , n. (12) 

If we introduce a diagonal {n x n)-matrix 

A = diag (ai, . . . , ctp, a^+i . . . , a„), 

then J^"-* = A • J ■ A and the functional we have to maximize, takes the form: 

F(") = (J(")a,a) = (J • Act, Act) - (jf, f) - ^(f). 

Here S, = Act, and the functional F{^) has been examined above. Consequently, all the results of the basic model 
remain correct, though here they are relative to configuration vectors 

CT* = (aiCTi, Q;2cr2, • • • , ttpCTp, Qfp+i, . . . , a„). 
Thus, e{n) can be used as the standard-vector without the loss of generality. 

2. Rotation of the memorized patterns 

An interesting generalization of the basic model is obtained when the memorized patterns (4) are subjected to 
rotation as a whole. Let the nontrivial part of the rotation matrix U = {uij) has the dimension p x p. It acts on 
the first p coordinates of n-dimensional vectors transforming the distorted part of the standard into a p-dimensional 
vector u: 



U2 



ui^^uu, 1^1,2,..., p, \\u\f^p. (13) 

i=l 



u — TJ ■ e{p) — 

\Up J 

As a result of the rotation the l-th memorized pattern takes the form 

(m1 - XUll,U2 - XU21, ...,Up- XUpl, 1, . . . , 1), (14) 

and it is convenient to choose the elements of the relevant connection matrix J'^'^' in the form 

Ji] = JijUiUj, 2, j = 1,2, . . . ,n. (15) 

From Eqs. (14), (15) it directly follows that if the standard- vector e{n) is not changed as a result of the rotation (in 
other words, ii ui = U2 = ■ ■ ■ = Up ~ 1) then the connection matrix J^'^' coincides with the matrix J. Consequently, 
all the results of the basic model are valid, though in this case the memorized patterns (14) are obtained from e{n) 
by simultaneous (!) distortion of its p coordinates. 

But if the rotation results in ui which are not all equal unity, the standard-vector shifts. Then it is convenient to 
introduce a diagonal (n x n)-matrix 

D = (wi,U2,.. .,Up,l,. ..,1) 

and present the connection matrix J'^'^' as a triple product J*^^' = D • J ■ D. By analogy with the above reasonings 
it is easy to show that again only the configuration vectors ct* (5) are "under the suspicion" to provide an extrcmum 



of the functional F^^\u) = {3^^^a, a). Again the value of the functional F''^\d*) is given by Eq.(6), but cos w has 
to be calculated with the help of other equation: 

cosw = ^ =^ ^ 16 

p II <^ II • II " II 

(compare with Eq.(7)). Thus, the value of the functional F^^'{(j*) is completely defined by the distortion x and the 
value of the cosine of the angle between a' and u. Consequently, again the vectors a* are grouped in the classes 
Yif, ' , inside which the value of the functional F'^'^^((T*) is constant. The vectors a* belong to the same class if their 
p-dimensional parts are equidistant from the vector u. The number of the classes Sj. is equal to the number of the 
different values of cos w (16). 

Thus, it is necessary to put in order 2^ vectors a' with regard to their proximity to the vector u. This problem will 
be discussed in another publication. But now let us assume that the desired ordering has been done and a decreasing 
sequence of the values of cos w (16) has been obtained: 

cos Wo > coswi > . . . > coswt. (17) 

Here the number of the classes E^ is equal to t + 1. We denote by Vk the number of the vectors a* in the fc-th class: 

VO + VI + ...+vt = 2P. 

Since the half of the vectors a' can be obtained from the other half of the full set of these vectors by changing signs 
of all their coordinates, for every k we have obvious equalities: 

coswk ^ -coswt-k, Vk^vt-k- 

Therefore, cosw^ is negative beginning from some number k. 

We see that as above to determine the dependence of the ground state on the parameter x, it is necessary to examine 
the family of the straight lines Lk{x) (8) where the cosines are calculated according to Eq.(16). The analysis of the 
placement of the straight lines gives the result which generalizes the Theorem (the proof is given in Appendix): 
As X varies from — cxd to cxd the ground state in consecutive order belongs to the classes 

The rebuilding of the ground state from the class 'S\.-i *'^^'' ^^^ class S^ occurs at the point Xk of intersection of the 
straight lines Lk-i{x) and Lk{x): 

a;fe = |(l + ^ ^^ r), fc=l,2,...,/c„,„ (18) 

z \ g + ]3(cosu'fe_i + cos Wfe) / 

where coswk belong to the ordered sequence (17). If xi > jp, one after another all the t rebuildings of the ground 
state take place according to the above scheme: kmax — t- And if xi < jp, the rebuildings of the ground state come to 
an end when the denominator in Eq.(18) becomes negative. 

Note 1. When the distortion x belongs to the interval {xk,Xk+i)i a-U the configuration vectors from the class Sj^, 
arc the local maxima of the functional i^('^) (x). At the same time the set of these vectors is the set of the fixed points 
of the Hopfield type network whose connection matrix is j'^' (we remind that its diagonal elements are equal zero). 

But the composition of each of the classes eJ^ is determined by the values of {ui}^^^ (13) only. And the choice of 
{''^'}f=i is completely in the researcher's hand. In other words, selecting {ui\^^^ and the distortion parameter x, the 
Hopfield type network with a preassigned set of fixed points can be created. 

Let's explain this statement with the help of a simple example. Let's create the network with the only fixed points 
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Then it is necessary, firstly, to group these vectors into one class Yi\, , and, secondly, to choose such distortion x 
which provides the belonging of the ground state to this class. The first step can be easily done, if with the help of a 
rotation of the memorized patterns the standard e{n) transforms into the vector 

M = (wi, ui, ui, -— , — , 3wi, UT,. .., Up), <ui < -— < 3ui < uy < . . . < Up. 



Indeed, it is evident that in this case the first six values of the cosines (16) and the relevant classes S^ ' are: 



cos Wo = ^j"^ — , So 

coswi = COS Wo — 2 • — , S^ ' 



cosu;2 = coswo — 3 • — , S2 



COS Ws = cos Wo — 4 • ^ , S3 
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C0SW5 = coswq — 6 • — , Eg ' 
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Thus, the class Eg ' consists of the given configuration vectors a, (3 and 7 exactly. Cosequently, if the distortion 
parameter x is chosen inside the interval (xsjXg), where X}. are defined by Eq.(18), the relevant network of the 
Hopfield type has the preassigned set of the fixed points. 

We would like to mention that an additional analysis is required to find out the limits of our method for the 
construction of networks with a preassigned set of the fixed points. 



3. Different distortions of the memorized patterns 

The distortions xi can differ for every of the memorized patterns (4): 

fl l,l-x,,l,...,l), / = l,2,...,p. 



^i) 



With the help of the reasoning the same as in II A , it is easy to show that as above only the configuration vectors 3* 
(5) are important. And the value of the functional F{a*) is 



F{d*) — 2_. xj — 2{q +pcosw) 2_. ^i^i +p{q +pcos w)^, 



1=1 



1=1 



where again cos w is given by Eq.(7). Under the comparison we have to omit the term X]f=i -^f • Now we obtain that 
to determine the ground state dependence on the set of the distortions {xi}i, it is necessary to examine the family of 
the linear functions 



L{a*,{xi}'l) = {q+pcoswf - 



2{q + pcosw) 
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■E 
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(JlXl. 



(19) 



For this analysis one can try to use a probabilistic approach, considering xi as independent realizations of a stochastic 
variable, which, let us say, is Gaussianly distributed. This way brings us nearer to the classical problem of the network 
ability to generalization (see Introduction). We have not investigate this approach. 

On the other hand, we can remain in the framework of the geometric approach, which is rather fruitful for this 
problem. Let's show this. Without loss of generality we assume that the distortions xi are arranged in increasing 
order. And for simplicity we assume them to be positive. Then, 



< Xi < X2 < ■ ■ ■ < Xp. 



(20) 



As in II A we have 



2k , , 

coswfe = 1 ^ q + pcoswk = n — 2k, fc = 0,l,...,p, 

P 

and the family of the functions (19) decomposes into p + 1 subfamilies Lf^(a* , {xi\\). Here the subscript k indicates 
that the relevant configuration vector a* belongs to the class S^: 

Lt,{a\{xi]\) ^{n- 2kf - ^^" ' ^^^ V a^x,, a* G S^, k^Q,l,...,p. (21) 

The decomposition of the family of the functions (19) into p + 1 subfamilies (21) allows to find easily the ground 
state for a given set of the distortions {x{\\. Indeed, to determine the ground state we have, firstly, to find the vector 
a*(fc) inside every class Sfc, which provides maximum L*^. of the function Lk{(y* , {x;}^): 

Ll^Lk{a*{k),{xi}{)>Lk{a*,{xi}{) Va* G S^, k^O,l,...,p. 

And, secondly, we have to determine the greatest among p + 1 values L^,. The last can be done easily, and the 
determination of the vectors (J*{k) does not require additional calculations at all: if n — 2fc > 0, we have to minimize 
the factor X]f=i '^i^i (^^e Eq.(21)); if n — 2fc < the same factor has to be maximized. From inequalities (20) it is 
evident that the factor is minimal at the vector 



a*(fc) = (l,l...,l,-l,-l,...,~l,l,l,...,l), 

^ V ' 

fe 
and it is maximal at the vector 



a*(fc) = (-l,-l...,-l,l,l,...,l,l,l,...,l). 
^ ^ ^ 

k 

Thus, when the distortions xi are different, the geometric approach allows to determine the ground state very quickly 
and efficiently. It seems, that the potentiality of the geometric approach does not exhausted by the mentioned 
examples. 

Note 2. We denote by x the average over all the distortions 

V 

We write the distortions xi in the form: 

xi^x + di, / = 1, . . . ,p, 
where di = xi — x \& the deviation of the ^-th distortion from the mean value x. Now Eq.(19) can be rewritten as 

L{(j* , {xi}\) = {q + pcoaw)^ — 2x{q ~\- pcosw)cos'w TJo-jd;. (22) 

P 1=1 

Let's discuss the case of slightly different distortions. That is, the deviations of all the distortions from the mean 
value X are sufficiently small: 

|dj|<<l, 1 = 1,..., p. 
Then it is clear that the first two terms give the main contribution in Eq.(22). We would like to remind that just these 



two terms were analyzed in [I A (see Eq.(8)). Consequently, in the zeroth approximation we have, that if x is inside 
the interval [xk, Xk+i] (xk are defined by Eqs.(9)), the ground state belongs to the class S^, and the global maximum 
of the functional is (^)-times degenerate. What will happen when the third term in Eq.(22) is taken into account? It 
can be expected that for sufficiently small di, this term does not lead the ground state out of the class Sfc, but only 
lifts of the ground state degeneracy. Numerical experiments confirm this assumption: when all di are not very great 
and X G {xk,Xk+i), all the local maxima of the functional (1) belong to the class Sfc, the values of the functional at 
vectors a* G E^ differ slightly and one (or several) of these vectors provide(s) the global maximum of the functional. 
Thus, we explain one of the possible reasons, why the functional (1) has a lot of slightly different local maxima. One 
frequently faces such situation in the neural network theory, Ising model and the optimization problems. 



4- Other generalizations of the basic model 

Without dwelling on the details, we would like to mention other possible generalizations of the basic modeln. 

1°. The normalized memorized patterns. 

The memorized patterns (4) can be normalized to unit to prevent their length being dependent on the varying 
parameter x. It is easy to see, that in this case all the reasonings, which lead to the Theorem, are valid. It can be 
shown that here the maximum value of the functional for the ground state decreases monotonically as function of x. 

2°. The identical distortion of a group of coordinates of the standard. 

Let m coordinates of the standard s{n) be distorted simultaneously and identically. Then [p x r7,)-matrix of the 
memorized patterns has the form 
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Of course, now n = p x m + q. Let's note that the nondiagonal elements of the relevant connection matrix j'™) have 
the terms, which are quadratic in distortions. 

With regard to the arguments of II A, it is not difficult to show that the maximum of the functional F^"^^ = 
(j(™) . a, a) is achieved only at "piecewise constant" configuration vectors 



(o-i,- 



■ ,0-1,0-2, 



, (72, . . . ,0-p. 



, (Tp , i , . 



!)• 



The form of these vectors is dictated by the memorized patterns matrix S*-™-*. The value of the functional i^(™) at 
the vectors <j* is 



i^("')(a*)oca;2 



2x cos w { h p cos w 



p cos w 



where w is the angle between the p-dimensional vectors a' = (cri, 0-2, . . . , o-p) and e(p), and cos w is given by Eq.(7). 



As above, 2^ vectors a* are distributed among the classes Sj," at which the value of the functional i^^"*) is constant: 



i^("')(a*) = i^^"'^ W*eT. 



^(m) 



0,1,. 



^(m) 



The structure of the classes Ei. is much alike the structure of the classes Efc from II A. The only difference is that 



here the value —1 have not one of the coordinates ai of the vector a* , but a whole group of m such coordinates. 



Again, as above, the number of the vectors a* in the class SJ^™ is equal to (^). The same reasoning as in II A leads 
to the generalization of the Theorem: 



(m) 



^(m) 



The rebuildings of the ground state from the class S^j^ into the class Ej, occur at the points xj^ 



A-^) 



S™-) 



q + mp{cos Wfc_i + cos Wk) 



P- 



+ p-2{2k- 1)' 



■ , f^max- 



U P < j^ — ^ 1, one after another all the p rebuildings of the ground state take place according to the above scheme: 



p. And if p > -^ — h 1, the last rebuilding is the one whose number is k„ 



■^max — f ^^'-^ "J f ^ 2m 

no other local maxima. 



^+P+2 



The functional has 



For what follows we would like to note, that when m increases the rebuilding points x^ with k < ^^ move to 



the left, and the rebuilding points x^ with k > ^-^ move to the right, lip is an odd number, necessarily there is 



the rebuilding point with the number k = ^^: Xp±i = p. When m increases this point does not move. 



'See also Note 3 of III B 



III. DISCUSSION AND INTERPRETATIONS 

Let us discuss the results which mainly refer to the basic model. 

A. Neural networks 

In connection with neural networks the Theorem has to be interpreted in the framework of the meaningful setting 
of the problem, which has been given in Introduction. Then the Theorem means: The quality of "the truth" (i.e., 
the standard e{n)) which is understood by the network depends on the distortion value x during the learning stage 
and on the length p of the learning sequence. We would like to point out some important details of the suggested 
interpretation. 

1°. In agreement with the common sense the error of the network increases with the increase of the distortion x: 
when X belongs to the interval (xk,Xk+i), "the truth" which is understood by the network (the class Sfe) differs from 
the standard e(n) by k coordinates. 

Also it is quite reasonable that the left boundary of the rebuilding region xi is the increasing function of p and 
n. Indeed, when n and x are fixed, merely due to increase of the number of the memorized patterns p the value of 
xi can be forced to exceed x (of course, if x is not too large). As a result x turns out to be on the left of xi, i.e. 
in the region where the only fixed point is the standard e(n) . In other words, only by an increase of the length p of 
the learning sequence, one can attain that the network understands correctly "the truth" it is tried to be learned. 
This conclusion is in agreement with the practical experience according which the greater the length of the learning 
sequence, the better the signal can be read through noise. From this point of view the most reliable network of the 
considered type is the one with p — n: for any distortions x < x' = "^ it steadily reproduces the standard-vector (see 
the comment to Eq.(lO)). 

In the same way when p and x are fixed, merely due to an increase of the number n the value of xi can be forced 
to exceed x. This result is reasonable too: when p is fixed and the number n increases, the relative number p/n of 
the distorted coordinates of the standard-vector decreases. Naturally, the less the relative weight of the distortion, 
the better must be the result of the learning. 

2°. Under an increase of the distortion parameter not only the error of the network increase, but also the number 
of "noncorrect truths" which are understood by the network: when x S (xfc, Xk+i) the number of the network fixed 
points is (?). Such multiple degeneracy of the "truth" is in agreement with the common idea that in general there is 
only one "real truth", but the number of the "noncorrect truths" is large. And also, the greater is the distortions at 
the learning stage, the more can be the number of "noncorrect truths". 

However, when x becomes greater than Xp+i = p, the number of "noncorrect truths" decreases. And this number 

2 

decreases monotonically under the further increase of the distortion parameter. Moreover, according to the Theorem, 
when ^5i < |> for ^^ ^ whose values exceed the right boundary of the rebuilding region, the ground state belongs to 
the class Sp. And, in spite of the very great distortion, the ground state is a nondegenerate one! In other words, the 
number of "noncorrect truths" is a nonmonotone function of the distortion x. 

This result seems to be a paradox. This paradox can be explained when it is related to the well-known feature 
of our perception: we interpret deviations in the image of a standard as permissible ones only till some threshold. 
If only this threshold is exceeded, the distorted patterns are interpreted as quite different standard. The Hopfield 
network under consideration demonstrate just the same behavior with Xp+i = p being the boundary of the permissible 

2 

distortions of the standard. Let's prove the last statement. 

Here the notation for the standard e{n) differs from the one used above, and it is 

£<+) = (l,l,...,l,l,...,l). (23) 

^^ V ' 

p 
We also introduce another standard: 

e<-) = (-l,-l,...,-l,l,...,l). (24) 

^ ^ ^ 

p 

In their not coincident parts the standards e^^' and e^^' are opposed with each other, i.e. they are two opposite 
statements. Any of the network fixed points a* (5) is an intermediate statement between e^+-' and e^^', which is 
drawn towards either one edge of the scale (e^^'), or the other (e*-"-*). 



10 



Which distortions of the standard e^+^ have to be assumed as permissible? Evidently, those for which the result of 
the learning resembles the standard e^^' more, than its opposition e^~K This is, the fixed points a* have to differ from 
the standard £*-+) not more than by | first coordinates: ct* G Sfe, fc < f ■ ■^^'^ ^^^^ takes place only if a; < Xp+i = p. 
But if a; > p, the fixed points a* belong to the class Sfc with k > ^, and they more resemble the standard er-^', than 

Let's point out the following curious circumstance. Let not the one, but a whole group m of the coordinates of 
the standard £^+^ be distorted. Then the rebui lding p oints xj!^ with the numbers k < ^i— move to the left when m 



increases (see the last paragraph of item 2° in [IB 4). And this is clear: now the total distortion of the standard is 
?7i-times enlarged and the network earlier ceases to extract the standard from the memorized patterns. It is somewhat 
unexpected that the boundary x^i = p oi the permissible distortions of the standard does not depend on m. It might 

appear that when the total distortion is TTi-times enlarged, the boundary of the permissible distortions is reached m- 
times quicker. However, this is not the case. Apparently, the reason is nonadditivity of the influence of the distortion 
X on the result of the learning. Out of the simultaneous distortion of m coordinates of the standard, it does not follow 
at all that xj™ becomes ?n-times less than Xk- When m increases, the rebuilding point Xi undergoes the most shift 
to the left. The shift of Xj is somewhat smaller, the shift of x^ is still smaller, and so on. The rebuilding points 
^k '^ith the numbers k ~ p/2 shift only negligible. It might be assumed that when the distortion x is so big that 
the result of the learning differs from the standard significantly (it is almost impossible to understand the standard) , 
it is not so important one or several parameters are distorted. Although formally just Xp+i = p is the boundary of 

2 

the permissible distortions. 

In our opinion, the most interesting conclusion here is that the Hopfield type network understands the distortions 
as permissible only up to some threshold. That is, a nontrivial feature of the human perception is inherent in the 
artificial network of the examined type. 

3°. When the distortion x is not very large the memorized patterns s^'-* are interpreted by the network as the 
distorted copies of the standard e^"*"^ . But if during the learning stage the distortion x is very large (it exceeds p) , the 
network ceases to understand the memorized patterns (4) as distortions of the standard e^+^ . It does not mean that 
in this case the memorized patterns (4) are understood as distortions of the opposite standard e^': the permissible 
distortions of e^"^ are the vectors 



(-l,...,-l,-l + y,-l,...,-l,l,...,l), 1 = 1, 2,. ..,p (25) 

I 

where y is not so large. The question is, what do the memorized patterns (4) with large distortions x mean? We think, 
that the increase of x above the the threshold p can be understood as the more and more negation of the standard 
e^^' . As if the network is learned by the memorized patterns, which denies the standard e^+-'. In other words, the 
network is relearned by presentation of negative examples. 

There is big and clear to everybody difference between the relearning with the help of negative examples and 
the learning of the opposite truth. In the framework of our model in the last case slightly distorted n-dimensional 
vectors of the type (25) ought to be used as memorized patterns. Then, with no problem the network understands 
the standard £^~\ The relearning is entirely other case. It is well known: the better the incorrect truth has been 
understood, the more difficult (and sometimes even impossible) to correct it; it is comparatively easy to correct the 
result slightly, but it is much more difficult to revise it in the main, etc. 

We think, that the dependence of kmax on p (see the Theorem) is the reflection of just these problems. When the 
number p of the parameters which have to be corrected is not very great (^Ey "^3)' the network can be relearned by 

simple presentation of negative examples. In this case kmax = P and, when "the denial" of the standard e^^' is rather 
strong (x > Xp), as "a new" truth the network understands the opposite standard £^>. But if the number of the 
corrected parameters is great {■^^^ > 3), to relearn the network it is not sufficient to present the negative examples. 
In this case [^^] < k^ax < P and whatever large x is {xk^^-, < x < 00), as a new truth the network understands not 
the opposite standard £^^\ but one of the statements intermediate between e^+^ and s^~\ Though the understood 
truth is drawn towards s^~\ since kmax > f ■ 

Of course, our interpretation is open for discussion. But it seems that in real life there are a lot of examples, which 
confirm our conceptionQ 



*In this connection we are tempted to compare the development of the reforms ("the relearning") in Russia and in China. In 
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B. The Ising model 

The interpretation of the Ising model in terms of the matrix S is not known yet. Therefore here the obtained resuhs 
are interpreted starting from the form of the Hamihonian J (3.b). Let's write it in the form 



J OC I „T 



A B 

BT C 



where the diagonal elements of {p x |5)-matrix A and (q x q)-matrix C are equal zero, and 

aij = l-2y, i,j ^l,2,...,p, ij^j; 

hk^l-y, i = 1, 2, . . . ,p, fc = 1, 2, . . . , g; 

Cfci = 1, fc,Z = l,2,...,g, fc^Z; *- '' 

The matrix J corresponds to a spin system with the infinitely large interaction radius. The system consists of two 
subsystems, which are homogeneous with respect to the spin interaction. The interaction between the p spins of the 
first subsystem is equal to 1 — 2y; the interaction between the q spins of the second subsystem is equal to 1; the 
crossinteraction between the spins of the subsystems is equal to 1 — y. When p = n all the spins are interacting with 
each other in the same way and we have the usual mean-field approximation for the Ising model at zero temperature. 

While y < 2, all the spins are interacting in the ferromagnetic way; when ^ < y, the interaction between the spins 
of the first subsystem becomes of antiferromagnetic type, and when 1 < y the crossinteraction is of antiferromagnetic 
type too. The Theorem allows to trace how the ground state depends on the variation of the parameter y. 

Let p < n. For y G (— c», |) the ground state is the ferromagnetic one since ^ < j/i = — , and for x < xi the 

ground state is the standard- vector £^+' (23). It is interesting that the ground state remains ferromagnetic even if 
2 < y < yi, i.e. when the antiferromagnetic interactions already are shown up in the system. In other words, when 
p < n, there is "a gap" between the value of y corresponding to the destruction of the ferromagnetic interaction, and 
the value of y corresponding to the destruction of the ferromagnetic ground state. Only after a "sufficient amount" 
of the antiferromagnetic interaction is accumulated, the first rebuilding occurs and the ground state ceases to be the 
ferromagnetic one. Then, when another critical "portion" of the antiferromagnetic interaction is accumulated, the 
next rebuilding of the ground state occurs (it happens when y exceeds y2 — — ), and so on. After the parameter y 

reaches the value y" — 1 ~ ^^ , the crossinteraction becomes antiferromagnetic too. But the rebuildings of the ground 



state go on, since usually x^^^^ > p (see the comment to the Theorem in II A). 

The energy E = — F of the ground state as a function of the parameter y has breaks at the points yt — —■ It 
increases till y < 1 and decreases when y > I. However, if the memorized patt erns ar e normalized to unit, the energy 



of the ground state is a monotonically increasing function of y (see item 1° in II B 4). 

It is natural to treat p — const, n — > cxd as an infinitely large sample with a few number of impurities. In this case 
all j/fc stick to the point y" — 1 (see the comment to Eq.(ll)). Depending only on the type of the crossinteraction 
between the impurities and the rest of the sample, the ground state is either the ferromagnetic one, or the spins of 
the impurities are opposite with respect to all other spins. 



Russia very great number of the parameters of the social order were simultaneously changed on their opposition. They are a 
very wide group of economic regulations, a group of parameters related to home and foreign policy, the parameters related to 
human rights and to the destruction of "the iron curtain" and so on. On the contrary, according to massmedia, in China only 
a restricted group of economic regulations has been changed up to now. 

And what are the results? Of course, now Russia is not a semi-fascist Soviet State, but it does not reach (until now?) the 
stable State system of the modern democratic type. Already for a long time the situation in Russia is characterized as a state, 
which is intermediate between these two oppositions. The permanent instability and variations of the internal situation in 
Russia are associated with "the degeneracy of its present ground state". In the same time (again, if the massmedia information 
is trustworthy), China demonstrates a steady increase of well-being of the people. And this stipulates the direction of its 
development. 

Being not a specialist, I have no right to intrude into such a special field of knowledge as politology. But in accordance with 
our results, it appears that, may be, "a step by step" strategy of reforms is more prospective. That is, the result will be better 
if at first one group of the parameters is corrected reliably, then the second group, then the third and so on. According to the 
Theorem, taken separately each group of the parameters can be corrected reliably (of course, if the Theorem is applicable in 
this case). On the other hand, the final result of "the relearning" of these two colossi is unknown yet... 
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Finally, let's discuss the case p = n. Then all yk stick to the point y' = | (see the comment to Eq.(lO)). Here the 
destruction of the ferromagnetic interaction occurs simultaneously with the change of the ground state ( "the gap" 
disappears). As long as the interaction of the spins is ferromagnetic {y < ^), the ground state is ferromagnetic too. 
But when the interaction of the spins becomes antiferromagnetic (y > ^), the ground state turns out to be ( ,i"i ) -times 

degenerate. From the right of i^ it is natural to associate the state of the system with the spin glass phase. 

Note 3. We would like to remind that the Hamiltonian can be generalized (see Eqs.(12) and (15)). Nevertheless, 
at heart of all these generalizations is the strict relation between the elements of the submatrices A and B of the 
Hamiltonian: 

1 + a = 2&, 

see Eqs.(26). This restriction on the matrix elements is justified by nothing. It would be interesting to examine the 
case when the matrix elements a and b do not related in any way. We have succeeded in the generalization of our 
consideration on this case as well as on the case when the linear term h X]i=i '^i has been added to the functional 
F{(f) (1). In the Ising model due to such a term the external magnetic field can be taken into account. Now we 
prepare these results for publication. 

C. Factor analysis 

In the framework of factor analysis a (p x n)-matrix S with real matrix elements (see Eq.(2)) plays an important 
role. Here the matrix S is an empirical matrix of the 'Ipbjects-parameters" which includes the exhausting quantitative 
description of the phenomenon under consider ationH. Its rows are p objects defined by n parameters which are 
available for the measurement: 



.^o^r..w_..w 



„(0 



(s^ ,S2 , . . . ,s„ ) G R'^, l — l,2,...,p - the vector-objects. 



/ 4'^ \ 

(2 



V s'r^ I 



G RP, i = 1, 2, . . . , n - the vector-parameters. 



The main idea of the factor analysis is as follows: by proceeding from n p-dimensional vector-parameters we have to 
introduce such t artificial characteristics, the so called factors, that their number is much less than n (i <C rt), but 
the description of the objects in terms of these factors does not lead to the loss of an important information about 
the objects. The factors fj are constructed as linear combinations of the vector-parameters si. 



/.- 






which in the sensejof a measure of proximity minimize the total deviation of each factor from all the vector-parameters. 
It has been shownEI, that for the so called centroid method, which is widespread in factor analysis, the problem of the 
construction of the first factor is equivalent to the maximization of the functional (1) with the connection matrix of 
the Hebb type. The interpretation of the obtained results in terms of factor analysis will be presented elsewhere. 
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A. The family of the straight lines Lj, (x) 

The family of the straight hnes (8), 

Lk{x) = (n - 2kf - 2x- [n - 2fc), /c = 0, 1, . . . ,p, 

P 

is under investigation. When x := we have Lfc(O) — {n ~ 2fc)^. Then if p < n/2 the values Lfc(O) decrease 
nionotonically when k increases: 

Lo(0)>Li(0)>L2(0)>...>ip(0). 

And if p > ri/2, beginning from the number n — p the values Lk{0) become twice degenerate: 

Lo(0) > Li(0) > . . . > L„_p(0) = Lp{0) > L„_p+i(0) = ip-i(O) > . . . 

Depending on the number k the straight lines Lk{x) behave themselves in different ways: for k < ^ and k > ^ the 
slope of Lk{x) is negative, and for | < fc < ^ it is positive. The straight lines with the numbers k — ^ and fc' = f (if 
such k and k' are found) are parallel to the axis OX. 

We denote by X(^k.k') the abscissa of the point of intersection of the straight lines Lk{x) and Lk'{x): 



P 

':{k,k') = 2 



q + p{cos Wk + cos Wk' ) 



''-^^' + ^) 0<k<k'<p. (27) 



n + p-2{k' + k) 



We would like to point out that the abscissa x^j^j^i) depends on the sum k + k' only. When n + p is odd, the 
number of different abscissae x/f^j^i\ is exactly 2p — 1. When n +p is even, there can be such numbers k' and k, that 
n + p — 2(fc' + fc) = 0. The relevant straight lines Lk{x) and Lk'{x) are parallel (they intersect at the infinity). 
Generally, the abscissa X(^k,k') increases when the sum k + k' increases 

X{as) < 2^(0,2) < a:(o,3) == a;(i^2) < a:(o,4) = a;(i^3) < . . . 

But the increase is strictly nionotonic only for 

n-1 1 

< -. 

n- 1 - 3 

In this case the denominator of the expression for x/f^j^n is nonnegativc for all k and A:', and all the intersections of 
the straight lines take place in the region x > 0. But if 

p- 1 1 
> 



n~l 3 

the abscissae X(k^k') increases monotonically up to the value k + k' — ^^^-^ where it has a discontinuity and becomes 
negative, and after that X(^k,k') again increases monotonically with increase of fc + k' . It is easy to verify that for 

■^ < -^^^r ^ 2 ^^^ ^^^ abscissae X(^k,k') with k + k' > ^^^-^ remain negative. But if -^^^r > 2 ' ^^ least x^p^ip-f appears in 
the region a; > 0. Nevertheless, in this case also x/p_ip\ remains less than a;(o,i)- In Fig. 2 the dependence of x/f^j^n 
on k + k' is shown both for ^^ less and greater than ^ . 

B. The proof of the Theorem 

We divide the proof into two steps. 

1°. Let's observe how the ground state changes as x increases. For a; = the ground state is the one with fc = 
(see above the inequalities for Lfc(O)), and this state is nondegenerate. As x increases, the state with fc = remains 
the ground state, till the straight line Lq{x) does not intersect any other straight line Lk'- From Eq.(27) it is evident, 
that the first intersection happens with the straight line Li{x) at the point 

n — 1 

XI =a;(o,i) =P — \ ^• 

^ ' 71+ p — 2 
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FIG. 2. The dependence of the abscissa x^i^^i^i^ of the intersection of the straight hnes Lkix) and Ly{x) on k + k' 
character is defined by the value of the parameter -^^ : a) . for -^^^ < 



Its 



3 the curve X(^t,k') increases monotonically, remaining all 
the time in the region a; > 0; b). for i < ^^ the monotonically increasing curve X(k,k') has discontinuity when k + k' = ^^-^, 
after that again it increases monotonically. 



After this the ground state is the state with k ^ I from the class Ei. This state is p-times degenerate. 

As X increases further, the state with fc = 1 remains the ground state till the straight line Li{x) does not intersect 
any other straight line Lk>{x). From Eq.(27) it is evident, that the first intersection takes place with the straight line 
L2{x) at the point 



X2 =a;(i,2) =P 



n — 3 
n + p — 6 



After this the ground state is the state with k = 2 from the class E2- And this state is (2) -times degenerate. 

B y ana logy with the above reasoning and taking into account the behavior of the family of the straight lines Lk{x) 
(see V A ) , we obtain that when x increases the rebuildings of the ground state happen at the points 



Xk = X(^k-l,k) 



q 



q + p{coswk-i +coswk) 



{2k - 1) 



p-2{2k- 1) 



fc= 1,2,3,. 



is replaced by the 



At the point of the k-th rebuilding the (^^j^j-times degenerate ground state from the class Efc 
(^) -times degenerate ground state from the class Efc. 

The rebuildings of the ground state take place till the increase of x is accompanied by the intersections of the 
straight lines Lk-i{x) and Lk{x). In other words, till 2A; — 1 is less than ^^. Since for ^5t — i Gven 2p—l< ^^, in 
this case as x increases one after the other all the p possible rebuildings of the ground state take place. Consequently, 
for all X > Xp = a;(p_ip), the ground state remains in the class Ep. (Strictly speaking, if ^^i = 5) the last rebuilding 
of the ground state happens at the infinity.) 

But if ^^ > i the rebuildings of the ground state come to an end when 2fc — 1 becomes more than ^^4^. For 

n—l 3' ^ ^ 2 

these k the abscissa X(^k-i.k) of the intersection of the straight lines L^-i and Lk is negative, and under the further 
increase of x there is no new intersection of the straight line Lk-i with any other straight line Lk'- In other words, 
the state with the number k — 1 remains the ground state under the unlimited increase of x. In this case the number 



of the last rebuilding is defined from the equation 



^fhri 



-1 = 



P 



h — 



n+p + 2 



(28) 



r n+p+2 1 



Thus, in the region x > the last rebuilding of the ground state takes place when x = Xk^^^ . (If ""'"^^^ = [- — ^ — j . 
the last rebuilding takes place at the infinity.) For negative x there are no rebuildings of the ground state, since in 
this region the straight line ^0(2;) majorizes all other straight lines. 
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2°. To complete the proof, let's show that the functional has no other local maxima. The configuration vector, 
which provides a local maximum, has the form (5) necessarily. Let for some x this vector belong to the class E^. 
Then the inequalities 

Lk(x) > Lk-i{x),Lk+i{x) 

must be fulfilled. Removing the brackets and collecting the similar terms we obtain: 

x[2±{n+p-4:k)]>p[l±{n-2k)]. (29) 

A). Let n + p - Ak > 2. It means that fc + 1 < "+^+^ , or, in other words, that k e [1, k,nax - !]• The sign "+" in 
Eq.(29) results in the inequality 

n- (2k -1) 

X > p -, r = Xk, 

-^n + p-2{2k-l) 

and the sign "~" results in the inequality 

n- (2k + 1) 

X < p ■ = Xk+l. 

n + p — 2(2k + 1) 

Thus, for fc G [1, fcmaa; — 1] the state with the number fc is the local maximum only if a; belongs to the interval [xk^ a^fc+i]j 
where this state is the global maximum. 

B). Let now —2 < n + p — 4fc < 2. It means that fc > kmax — 1 and fc < kmax', in other words, that fc — kmax- The 
sign "+" in Eq.(29) results in the inequality x > Xk^^^, and the sign "-" results in the inequality x > Xk^^^+i. But 



we know that the value of Xk,„^^+i is already negative (see above V A). Consequently, the simultaneous fulfillment of 
the last two inequalities takes place in the region x > Xk^^^ . And this means that the state with the number kmax is 
the local maximum of the functional only if it is its global maximum. 

C). And, finally, let n+p — Ak < —2; in other words, let fc > kmax- The sign "+" in Eq. (29) results in the inequality 



X < Xk, and the sign "-" results in the inequality x > Xk+i- But as it has been shown in VA, even for fc > kmax 
we have the ordering Xk < Xk+i- Consequently, for fc > kmax the simultaneous fulfillment of inequalities (29) is 
impossible. This completes the proof. 

C. The region of the ground state rebuilding 

The region where the parameter x changes from xi to Xk„^^^ , will be called the rebuilding region of the ground state. 
Let's examine this region. 

The left boundary of the rebuilding region is 

n — 1 p 

Xi=P ■ t; > o> 

n + p — 2 2 

where the equality takes place only for p = n. It is easy to verify that xi is a monotone increasing function of both p 
and n. These statements are important for the meaningful interpretation of the obtained results. 

The expression for the right boundary of the rebuilding region depends on the relation between p and n. If ^^ < i, 
then kmax — P and 

_ _ n+l~2p 

-^fcmax — Xp — P ■ — — . 

n + 2 — 3p 

As function of n the value of Xp decreases. Consequently, for n — > oo and fixed p the rebuilding region contracts to 
the point x" = p. And if ^^f > | , than 

n ~ p + 4:K 

where 4k = 1, 2 or 3, and kmax is given by Eq.(28). When n is fixed and p —^ n, the rebuilding region also contracts 
to the point. It is not difficult to see that this point is a;' = ^. 



In many respects the point x" = p is remarkable (see III). It is easy to see that before x becomes equal to x" — p, 
exactly ^^ rebuildings of the ground state take place. This result does not depend on n (if only p is less than n — 1). 



16 



D. The rotation of the memorized patterns and kmax 

Let's prove the generaliza tion o f the Theorem for the case, when all the memorized patterns (4) are subjected to 
the rotation as a whole (see [IB 2). For simplicity we assume that the number i + 1 of different values of the cosines 
(17) is even: 



t + l = 2r. 

Let's agree the subscript k to take the values 1,2, . . . ,r — ^^ only. 

Since for every k we have coswk = — cos Wj+i_fe, then coswr-i — — coswr and the ordered sequence of the cosines 
has the form: 

cos Wo > COSWi > . . . > COSWr-l > > COS Wr > COSWr+1 > • • ■ > C0SU'2r-l- 

It is not difhcult to prove that the rebuilding of the ground state from the class S^ _-^ into the class E^ happens at 
the point Xj (18). In what follows we'll prove that if xi > jp, one after another all the t rebuildings of the ground 
state take place according to the above scheme: kmax — t. And if xi < jp, the rebuildings of the ground state come to 
an end when the denominator in Eq.(18) becomes negative. 

The numbers k and t + 1 — fc are symmetric with regard to the ends of the sequence of the rebuilding points. For 
every rebuilding point 



P 



1 



9 



q + p{coswk-i + cosuifc) 
the point xt+i-^ is called the dual rebuilding point. The expression for xt+i-k 



Xt+l^k 



P 



1 



q 



2 V q — p{coswk-i + coswk) J ' 
differs from the expression for Xk by the sign in the denominator only. The rebuilding point 



P 
2 



1 



q + p(cOS Wr~l + COS Wr) 



P 
2 



1+'- 

q. 



p 



is dual to itself. We begin the proof of the general statement with the proof of the following 
Lemma. 

1. The inequality Xk > jP is equivalent to the inequality q — p{coswk-i + coswfe) > 0. 

2. For the value of Xt+i-k to be negative, it is necessary and sufficient to have -^p < Xk < |p- 
The proof of the first statement is obtained from the convertible sequence of the inequalities 



Xk> ^P 



q + p(c0S Wk-l + cos Wk) 

Let's prove the second statement. Let 



> 



1 



P 

xt+i-k - 2 ■ 



1 



q — p(cos Wk-i + cos Wk) > 0. 



<0. 



q — p{coswk-i + coswk) 
Then, firstly, q — p{coswk-i + cos Wk) < 0, and it is equivalent to the inequality Xk < jP- And, secondly: 

^ ^— T<-1' 

q — p(cos Wk-i + cos Wk) 

from where we directly obtain q + p{coswk-i + cos Wk) < Sq, and, consequently, 

P 

Xk = -z 



q + p{cos Wk- 1 + cos Wk ) 



>^. 


h + -l 


2 


[ 3q\ 



Assume the converse. Let ^p < Xk < jP- This means that 



-q < q+ p(cos wt-k + cos Wt-k+i ) < 0, 
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from where it follows that 



Xt+i-k - 2 



q 



'P[COSWt-k +COSWt-k+lj 



<!■ 


1- 


g 
q. 



0, 



i.e. that the abscissa zt+i-fc is negative. This concludes the proof of the Lemma. 

Generally, the first rebuilding point xi always exceeds |. The proof of the main statement is in three steps. 
Step 1. Let's examine the case xi > jp. Then accordingto the first conclusion of the Lemma (7— p(coswo+cosit;i) > 
and therefore a similar inequality is true for any other k: q—p{cos Wk-i +cos Wk) > 0. Consequently, all the rebuilding 
points Xj (18) are positive. And this is just the case when in consecutive order one after another all t possible 
rebuildings of the ground state take place. It is interesting that this case corresponds to very simple requirement to 
the first rebuilding point: xi > |p. 



Step 2. Now we investigate the case ^p < xi < jp. 



2o^ ^ ^. ^ 3„ According to the second statement of the Lemma, here the 
rebuilding point Xt, which is dual to xi, is negative: Xt < 0. Then we pass from Xi to X2: xi < X2. If X2 is still 
less than |p, its dual rebuilding point xt-i is negative too. Then we must to pass from X2 to X3: if X3 is also less 



than |p, its dual rebuilding point Xt-2 is also negative, and so on. Since at least Xr 



the number k, for which Xk exceeds jp: Xk > jP- 
already positive: 



P > jP, 



sooner or later we get 



Then according to the Lemma its dual rebuilding point xt^ 



l-k IS 



Xt+l-k 



P 
2 



q — p{cOSWk-l +COSWk) 



>0. 



It is evident that xt+i^k is not only positive, but exceeds xi: 

Xt+l-k — Xi ~ cos Wo + cos Wi + COS Wk^l + COS Wk > 0. 

Thus, the last rebuilding of the ground state happens at the point with the number t + 1 — k. The next rebuilding 
point, xt-k+2, is already negative. In other words, kmax is exactly equal to j = t + 1 — k, where 

q + p{coawj^i + cos Wj) > 0, but q +p{coswj + coswj+i) < 0. 

And just these inequalities give the statement we have to prove. 

Step 3. Finally, we have to examine the case xi < |p. This inequality is equivalent to q — p{coswq + coswi) < —q, 

from which it follows that Xf > 0: 



Xt 



P 
2 



1 



q + p{cos Wo + cos wi ) I 



>l 


1- 


_ 9" 
q. 



0. 



In the same time xt < Xi: 



Xi — Xt 



q + p(cos wq + cos Wi) 



p(coswo + cosiui) 



>0. 



We pass from xi to X2- If X2 is also less than |p, its dual rebuilding point Xf_i is also positive (but this time it does 
not exceed x*). Increasing the number k we sooner or later get the rebuilding point Xk which exceeds |p. And then 
we find ourselves under the conditions discussed in Step 2: |p < Xfc < jp. Here the point Xt+i_fc, which is dual to 
Xk, is negative. At the next steps we get such a number k, that Xk exceeds jp. According to the above reasoning this 
means that k^ax — t + 1 — k. In other words, in the case xi < |p, at first Xj increase up to the maximal positive 
value, then become negative, and after that they increase. Beginning with a certain number j, the values of Xj are 
positive again, but less than xi. This concludes the proof. 

Thus, the rebuilding points Xj, j = l,2,...,t behave themselves in different ways depending on the interval to 
which the first point Xi belongs: 



xi > -^P, -^P<xi < -p, xi < -p. 



(30) 



The analogous result has been also obtained above, when in V A the family of the straight lines Lk{x) was investigated. 



There the behavior of the rebuilding points were depended on the belonging of the ratio -^^^j to one of the three intervals 



n — 1 1 1 p 

< — , — < ^— 

n — 1 3' 3 n 



1 1 1 p-1 
T ^ 2' 2 ^ n-1 



(31) 
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It is easy to verify that for ui = M2 = ■ ■ • 
even t = 2r is examined in the same way. 



1 the inequahties (30) turn into inequahties (31). The case of an 
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